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ON EIGENVALUE AND EIGENVECTOR ESTIMATES FOR 
NONNEGATIVE DEFINITE OPERATORS 

LUKA GRUBISIC 


Abstract. In this article we further develop a perturbation approach to the Rayleigh- 
Ritz approximations from our earlier work. We both sharpen the estimates and extend the 
applicability of the theory to nonnegative definite operators . The perturbation argument 
enables us to solve two problems in one go: We determine which part of the spectrum of 
the operator is being approximated by the Ritz values and compute the approximation 
estimates. We also present a Temple-Kato like inequality which —unlike the original 
Temple-Kato inequality— applies to any test vectors from the quadratic form domain of 
the operator. 


1. Introduction 

A perturbation approach to Rayleigh-Ritz approximation was introduced by Kalian 
in HU. The main idea is to represent the eigenvalues (vectors), which we do not know 
(but want to approximate), as perturbations of the Ritz values (vectors) which we have 
computed. This concept lies behind the standard subspace approximation theory from 
mm In our previous paper |9j we have shown a way to apply this concept to less regular 
test spaces than those which were considered in mm- In the present note we continue 
this study and both improve and generalize the perturbation estimates from [|5j. 

Let us introduce some preliminary notation. Let h be a positive definite symmetric 
form in a possibly infinite dimensional Hilbert space PL. The form h generates the positive 
definite operator H such that h(u,v) = (i/ 1 / 2 ^, i/ 1 / 2 ^). The test space for the Rayleigh- 
Ritz method will be ran (A), where A : C n —> 7i is an isometry such that ran (A) C 
£>(R 1/2 ). Set P = XX*, P ± = I - A A* and define: 

• the block diagonal part of h as the positive definite form 
h'(u, v ) = h(Pu, Pv ) + h(P±u, P±v) 

• the block diagonal part of H as the operator H' such that 

h'{u,v) = H^^v) 

• the Rayleigh quotient as the matrix S = (H 1 / 2 A)*# 1 / 2 A G C nxn . 
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On eigenvalue and eigenvector estimates for nonnegative definite operators 


The standard theory of 011 uses 

(1.1) max |(Z, Hx - H'x)\ = ||i?|| < oo, R = HX - AS = HX - H'X 


to obtain spectral estimates. The operator R is called the residual of the test subspace 
ran (A"). 

It has already been demonstrated—in 0—that Kalian’s concept can yield nontrivial 
estimates even when H — H' is not a bona fide operator, that is to say when ||i?|| = oo. 
We now continue the study from 0 and both sharpen the estimates and extend the 
applicability of the theory to nonnegative H. Our results are generalizations of the known 
estimates for finite matrices 0 EH, . A familiarity with the paper [Sj is not a prerequisite 
for this work. 

As a start we review some geometrical results from 0. Let for the moment 7i be finite 
dimensional. For the forms h and b! we have 

\(H 1 l 2 x,H l ' 2 x) - (H' 1/2 x.H' 1/2 x)\ 


( 1 . 2 ) 


(1.3) 


max ■ 


max ■ 

X 


(. x , H'x) 

\(H l / 2 x, H l / 2 x) - (H'^x, H'^x )| 
(x, Hx) 


= sin 0(iL 1/2 A, H~ 1/2 X), 

sin 0(iL 1 / 2 A, iL _1 / 2 A) 

1 — sin 0(i/ 1 / 2 A", H-V 2 Xy 


where sinO(i/ 1 / 2 A", H~ 1 ^ 2 X) is the sine of the maximal canonical angle between the sub¬ 
spaces H 1 / 2 X and H~ x l 2 X. We will slightly stretch the terminology and (colloquially) 
call m and m the energy-scaled residual measures. 

Eigenvalue estimates obtained from m are of the “absolute” type, i.e. 


(1.4) 


|A — /i| < ||i?||, 


whereas the estimates obtained from m-m are of the “relative” type 


(1.5) 


|A~hl 


< sin 0, 


l A ~hl 

A 


< - 

“ 1 


sin 0 
— sin 0 


We identify the following building blocks in (Q: 

• H and H' are considered as symmetric forms 

h(u,v) = (H^u.H^v) and h'{u,v) = (H'^u, H' l / 2 v) 

• monotonicity of the spectrum implies the estimates 

In 0 the perturbation estimate a was shown to hold for a positive definite operator 
in an infinite dimensional Hilbert space. We now prove the sharper estimate ( 11 . 21 ) for 
a nonnegative definite operator in a Hilbert space and give an alternative proof of m 
as a spinoff . We also generalize some further results which were derived from in 
Reference 0. 

The restriction ||i?|| < oo, necessary for (11.111 to give useful information in the unbounded 
operator setting, incurs ran (A) C V(H). For (11.21) and m to be applicable we only need 
to assume 

sin Q(H l/2 X,H~ l / 2 X) < 1. 

This “residual measure” will give nontrivial information even when ran(A) C T>[H X ^ 2 ) is 
such that ran(A) (jL D{H ), see JH and Section [7| of this note. 
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Notably, both approaches to measure the “residual” share the property: 

• sin GiH^X, H-^X) = 0 if and only if 
ran (AT) is an invariant subspace of H 

• ||i?|| = 0 if and only if 

ran (AT) is an invariant subspace of H 

An important feature of our theory is that it gives an abstract framework for a consid¬ 
eration of both eigenvalue and eigenvector estimates. 

To get a better feeling for the estimate (Cl consider a simple example. Set 


( 1 . 6 ) 


/!, - 


1 



100 


1 

' 100 


1 + 77 



1 

0 


0 

100 u n 


'l - 1 ' 


1 

1 


0 if 


0 1 


and e = [l 0] . We will analyze an approximation of the first eigenvalue of the matrix 
H v by the Ritz value fi e = e*H v e = 1CT 2 for i] large. 

As a starting point for developing a practical procedure to compute the estimates (d 
we use the formula 


(1.7) 


sin 2 0(H 1/2 X,H- 1/2 X) = max 

fcEran(X) 


(x, H l x) — (x, H' l x) 
(x, H^x) 


which is implicit in H Section 4.]. Since 


h'- l = 


100 0 
0 


1 + 7) 2 


H = 


100 + rj 2 rj 2 


r] 


-2 


V 


-2 


we compute, with a help of d. 


Ai (Hr,) = 

A 2 (H v ) = 
sin Q(H 1/2 e,H- 1/2 e) = 


1 + 50 if - v / l + 2500r/ 4 
100 

1 + 50 rf + y/1 + 2500 if 
100 


y/ioo^TT 


As a comparison we will use an estimate which can be obtained from the Temple-Kato 
inequality from m , see Cl below. The obtained lower bounds for Ai (H v ) are displayed 
in Table |U 

We can observe in Table El the same behavior which was showed on an infinite dimen¬ 
sional model problem from [Sj. Namely, the estimate 


(1 — sin 0)/i e < Ai, 


which is linear in sin©, outperforms the estimate 


( 1 . 8 ) 




\H v e 


H' v e\ 


Ao 




A Ai, 


which is quadratic in || H v e — H' e 
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n 

Temple-Kato 

sin © 

l 

9.998000499860e-7 

0.009004962810 

2 

0.007500015625 

0.009500623831 

3 

0.008888890261 

0.009666851698 

4 

0.009375000244 

0.009750078088 

5 

0.009600000064 

0.009800039988 


Table 1. Lower estimates for Ai (H v ) which can be obtained from the Ritz 
value /I e = 10” 2 with a use of Temple-Kato estimate and with a use of sin© 
approach. 


As an infinite dimensional analogue of m we consider the following operator. Let 
X[i,2] be the characteristic function of the interval [1, 2] C M. We consider which is 
defined by 

(1.9) (Hj / VHj /2 u)= [ (1 + V 2 X[i,2])u'v' dx. 

Jo 

and we choose 


( 1 . 10 ) 


ui{x) 


\/2sm(irx), 0 < x < 1 

0, 1 < x 


1 /2 

as a test function. Now u\ e X>(Hj/ ) but u\ 0 X>(H^) so neither of Temple-Kato estimates 
(for eigenvectors or eigenvalues) does apply since — /iwi|| = oo. 

Improved eigenvalue and eigenvector approximation estimates can be summed up in the 
following procedure 1 : 

• Let H be positive definite and let P be an orthogonal projection such that ran(P) C 
©(H 1 / 2 ) and n = dim ran(P) < oo. 

• If sin© < 1 (as defined by (ED) then there exist n-eigenvalues of the operator H 
which are approximated by the n Ritz values from the subspace ran(P) in the sense 
of (11.51) . 

• ^ i-shfe < a n+ |+^ n then the Ritz values from the subspace ran(P) approximate 
first n eigenvalues of H (counting the eigenvalues according to their multiplicities) 
and we have an eigenvector estimate. (Analogous estimates hold for any other 
contiguous spectral interval.) 


^ere we have assumed we are approximating the lower end of the spectrum. Analogous procedures 
can be formulated for other contiguous spectral intervals. 
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2. The notation and preliminaries 


The environment in this article will be a Hilbert space with the scalar product (•, •). 
The scalar product is antilinear in the first variable and linear in the second. We start 
with a closed symmetric form h(-, •) which is additionally assumed to be nonnegative 

(2.1) h[u] = h(u,u ) >0, u G Q(h). 

In the sequel when we say the nonnegative form h, we shall always mean the closed 
symmetric form h which satisfies m- The form h shall be called positive definite when 
it is closed, symmetric and there exists nth > 0 such that 

h[u\ = h(u,u) > m/i||w|| 2 , u G Q(h). 

There is also an equivalent operator version of these definitions. The selfadjoint operator 
H is called nonnegative if 

(u, Hu) >0, u G 27(H). 

Subsequently, H is called positive definite if there exists niu > 0 such that 

(u, Hu) > m H ||u|| 2 , u G f>(H). 

In this chapter we assume Q H = H, but later we shall also allow Q n to be any nontrivial 
subspace of 7 ~t. For nonnegative selfadjoint operators one defines, with the help of the 
spectral theorem, the usual functional calculus. We write the spectral decomposition of 
the selfadjoint operator H as 

H = J Ad£ H (A), 

where -Eh(A) is the right continuous spectral family associated to the operator H. When 
there can be no confusion we simply write E( A). 

The representation theorem for nonnegative forms [T21 pp. 331] implies that there exists 
a selfadjoint operator H such that X^H 1 / 2 ) = Q(h) and 

h(u, v ) = (H 1 / 2 u, H 1 / 2 u), u, v G Q(h) 

Following |Zj we call T>( H) the operator domain of H and <2(H) = X>(H 1,/2 ) the quadratic 
form domain of H. We write T> and Q when there can be no confusion. With the help of 
the spectral theorem we see that 


V(H) 

2(H) 


{ueH: ||Hu|| 2 
{u G H : h[u] = 


= j A 2 d(E(\)u,u) < oo}, 

||H 1/2 u|| 2 = [ A d(E(X)u, u) < oo}. 


In general, when dealing with the forms in a Hilbert space we shall follow the terminology 
of Kato, cf. 12|. In one point we will depart from the conventions in Till . A nonnegative 
form 


h(u, v) = (H 1/2 u, H 1/2 u) 
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will be called nonnegative definite when A e (H) > 0. Analogously, a nonnegative operator 
H such that A e (H) > 0 will be also called nonnegative definite. We will often say nonneg¬ 
ative, meaning the nonnegative definite. Now, we give definitions of some terms that will 
frequently be used, cf. mm- 

Definition 2.1. A bounded operator A : 7i —> U is called degenerate if ran(A) is finite 
dimensional. 


Definition 2.2. Let H and A be nonnegative operators. We define the order relation < 
between the nonnegative operators by saying that 

A < H 


if Q(H) C Q(A) and 

||A 1 / 2 w|| < ||H 1 / 2 u||, u E Q(H), 

or equivalently if 

a[u\ < h[u ], u E Q(h), 

when a and h are nonnegative forms defined by the operators A and H and A < H. 


A main principle we shall use to develop the perturbation theory will be the monotonicity 
of the spectrum with regard to the order relation between nonnegative operators. This 
principle can be expressed in many ways. The relevant results, which are scattered over 
the monographs 13 El, are summed up in the following theorem, see also m Corollary 

A-!]- 

Theorem 2.3. Let A = f A dE A ( A) and H = f A dEn(X) be nonnegative operators in 7i 
and let A < H. By 0 < pi < /i 2 < • • • < A e (A) and 0 < Ai < A 2 < • • • < A e (H) denote 
the discrete eigenvalues of A and H, then 

(1) Ae(A) < Ae(H) 

(2) dim En('y) < dim E A ( 7 ), for every 7 E M 

(3) p k < A fc , k = 1,2, • ■ • . 

We close this introductory section with the well known theorem about the perturbation 
of the essential spectrum. 


Theorem 2.4. Let H and A be positive definite operators. If the operator 

H 1 - A 1 


is compact then <r ess (H) = cr ess (A). 

3. The generalized inverse and angle between the subspaces 

There are many ways to express that u E Q(h ) is an eigenvector of the operator H. We 
will give a geometric characterization of this property. Assume that ||u|| = 1 and fi = h[u\. 
An elementary trigonometric argument yields 

(3.1) ||H 1/2 u — //H _ 1 / 2 u|| = O^sin0(H 1 / 2 u,H^ 1 / 2 u) = 0. 
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(IH.il) implies that u is an eigenvector of H if and only if sin@(H 1 / 2 M, H l / 2 u) = 0. The 
ability to assess the size of sin0(H 1//2 w, H ~ l / 2 u) will be central to the analysis of the 
Rayleigh-Ritz method in this paper. 

In this section we give the background information on the angles between two finite 
dimensional subspaces of a Hilbert space as given in Eununj. Basic results on generalized 
inverses of (unbounded) operators defined between two Hilbert spaces will be presented 
as well. These results will be applied to the problem of computing sin@(H 1 / 2 A, H -1 / 2 T) 
for the given positive definite H and some finite dimensional X C 2(H). 

Closed subspaces of the Hilbert space 7 i can be represented as images of the correspond¬ 
ing orthogonal projections. We shall freely speak about the dimension of the projection 
P meaning the dimension of the range of the projection P. In the case in which P is 
finite dimensional, we have another representation for the subspace ran(P). For a given 
n-dimensional subspace ran(P) C Q there exists an isometry X : C n —> 7i such that 
ran(P) = ran (A"), where P = A" A*. Therefore, ran (A) is an alternative representation of 
the n- dimensional subspace ran(P). The isometry A" will be called the basis of the sub¬ 
space ran(P). We shall freely use both representation of the finite dimensional subspace. 
P x = AA* will generically denote the orthogonal projection on the space ran(A) (for 
some isometry A : C n —> PL). 

Let ran(P) and ran(Q) be two finite dimensional subspaces of the Hilbert space hi. The 
function Z that measures the separation of the pair of subspaces ran(P) and ran(Q) will 
be called an angle function if it satisfies the following properties 

(1) Z(P, Q) > 0 and 

Z(P,Q) = 0 if and only if ran(P) C ran(Q) or ran(Q) C ran(P). 

(2) Z(P,Q) = Z(Q,P) 

(3) Z(P, Q) < Z(P, R ) + Z(P, Q) if 
dim(ran(P)) < dim(ran(P)) < dim(ran(Q)) or 
dim(ran(P)) > dim(ran(P)) > dim(ran((5)) 

(4) Z(PP, UQ) = Z(P, Q ), for any unitary U. 

In what follows we will use the following angle functions, see m 

(3.2) 0(P, Q ) = arcsinmax{||P(I — Q)||, ||Q(I - P)||} 

(3.3) ©p(P, Q) = arcsin min{ ||P(I - Q) ||, ||Q(I-P) ||} 

The function 0(P, Q) from (13.21) will be called the maximal canonical angle between the 
subspaces P and Q. The function 0 p (P, Q) from (13.31) will be called the maximal principal 
angle between the subspaces P and Q. 

The following lemma, which is a consequence of U21 Theorem 1-6.34], gives an insight 
in the behavior of the canonical and the principal angles which were defined by (El and 

O- 

Lemma 3.1. Let P andQ be two orthogonal projections such that dim(ran(P)) < dim(ran((5)) 
and let 

11^(1-2)11 <i 

then we have the following alternative. Either 
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(1) dim(ran(P)) = dim(ran(Q)) and 

sin @(P, Q ) = sin 0 p (P, Q) = ||P — Q|| < 1, or 

(2) dim(ran(P)) < dim(ran(Q)) and 

sin 0 p (P, Q) — ||P(I — Q)|| < 1- 

For most of our needs, Lemma rm describes the relation between the finite dimensional 
subspaces ran(P) and ran(Q) in sufficient detail. However, sometimes it will be necessary 
to analyze the structure of the finite dimensional projections Py = VV* and Py = UU* 
in further detail. To this end we define the canonical angles 9i,... ,9 n between the spaces 
ran(P) and ran(V) as 

(3.4) 9i = arccos cq, i = 1,..., n, 

where ai,... ,a n are the singular values of the matrix 

V*U e C mxn . 

We have assumed that m = dim ran(V), n = dim ran(P) and m < n. The canonical angles 
are related to the angle function (El through the formula, see na, 

sin Q(P v ,Pjj) = max sin 9 t . 

i 

We also dehne the acute principal angles 9\ < 9% < • • • < 91, where k < ji, as those 
canonical angles d* which satisfy the condition 0 < dj < 7t/2. Subsequently, we obtain a 
connection to the angle function (E3D through the formula 

sin 0 p (Py, P(y) = max sin df. 

i 

In dealing with the projections and degenerate operators it is useful to have a notion 
of the generalized inverse. We will use the definition of the generalized inverse of a closed 
densely defined operator in 7i from |15j . see also |121 Chapter IV. 5]. 

Definition 3.2. Let T : H —> U be a closed operator such that P(T) = 7d. The operator 
T* : U —> 7i is dehned by 

P(T f ) = ran(T) © ran(T) ± 

T t U = (T | ker ( T )J_) 1 Pran(T)M, U G P(T t ) 

and it is called the Moore-Penrose generalized inverse of T. 

The properties of the generalized inverse 2 are analyzed in the monograph In par¬ 
ticular we use the following characterization. 


2 The generalized inverses can also be defined in more general settings. Their properties are also analyzed 

in Cg. 
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Theorem 3.3 (see |Q1 Theorem 1.5.7]). Let T : TL —> U be the closed operator and let 
27(T) = hi, then Tt is the unique closed operator such that 

T t TT t = T f , on V(T*) 

TT^ = -Pran(T) |x>( T t) 

T'T = -P ker(X )x | 

where Pm is the orthogonal projection on A4. The operator T^ is bounded if and only if 
T has a closed range. 

The nonnegative operator Lh has the spectral decomposition 

H f = Jj dE( A), V(Hf) = {ueH: j y 2 d(E(X)u, u) < oo}, 
and the functional calculus implies 

Hi 1 / 2 — H 1//2 ^. 

Theorem run shows a relation between the Moore-Penrose generalized inverses and or¬ 
thogonal projections in a Hilbert space. This is precisely the reason why the generalized 
inverses will be useful in our study. 

A bounded operator W : Tt —> U is called partially isometric if there exists a closed 
subspace M. C Tt such that 

11^11 = 1 ^ 11 , uen. 

This is equivalent to 

W*W = P M - 

The set A4 = ra n(W*) C Tt is called the initial set of the partial isometry W and ran (W) C 
U is called the final set. Since ker(hP*) © ran (W) we see 

WW* = P ran(w) , 

so W* is also the partial isometry with the initial set ran(bh). We shall also use the 
notation 


W*W = P w *, 

It is obvious that 

and we have the following lemma. 


W* = W f 


WW* = P w 


Lemma 3.4. A bounded operator W : hi —>U is partially isometric if and only if 

ww*w = w. 


For the proof see |12j . 


Lemma 3.5. Let V and W be two partial isometries then 


P v Pw\\ = \\VP w \\ = \\V*W\\. 
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Proof. Using Lemma 13.41 we compute 

\\P v Pw\\ 2 = spr (P w P v Pw) = spr (WW*VV*WW*) 

= spr (W*VV*WW*W) = spr {W*VV*W) = \\V*W\\. 

In this computation we have used the identity 

spr (ABU) = spr (CAB), 

which holds for bounded operators A, B,C. □ 

4. Geometrical properties of the Ritz value perturbation 

In this section we will present a perturbation approach to the Rayleigh-Ritz approx¬ 
imation of the spectrum of a positive definite operator. The nonnegative definite case 
is technically more complex and warrants a separate section. Although this chapter is 
devoted to the positive definite case, some of the statements and definitions will be given 
in full generality in which they will be later used in the text. 

Let 0 < h be a nonnegative form and let ran (A") C Q(h) be the n-dimensional test 
space. The matrix 

s H ,x = (h 1 / 2 x)*h 1/2 x e c nxn 

will be called the Rayleigh quotient associated to the basis X. When there can be no con¬ 
fusion, we shall denote the Rayleigh quotient by S and drop the indices. The eigenvalues 
of the matrix S will be numbered in the ascending order 

(4.1) /A < R 2 < ■ ■ ■ < AG- 

We call the numbers /h the Ritz values of the operator H (form h ) from the subspace 
ran(X). This definition is correct since the eigenvalues of the matrix S do not depend on 
the choice of the basis X. In the rest of this chapter we will use P = XX* to denote the 
projection onto the range of the isometry X : C n —» hi. 

For the given h and ran(A") C Q(h ), P = XX*, we define the symmetric forms Sh and 
h' using the formulae 

(4.2) 5h(u, v ) = h(Pu , (I - P)v) + h({ I - P)u, Pv), u, v e Q(h) 

(4.3) ti{u,v) = h(Pu,Pv) + h((I- P)u,(I-P)v), u,v e Q{h). 

Obviously, (El and El imply 

(4.4) h'(u,v ) = h(u,v ) —8h(u,v), u,v E Q{h). 

Before we can proceed we need the following definition. 

Definition 4.1. If H is a selfadjoint operator and P a projection, to say that P commutes 
with H means that u E Z>(H) implies Pu E Z>(H) and 

H Pu = PHu, u E V(U). 

In what follows we will describe the properties of the symmetric form h' and of the 
operator H' it generates. 
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Lemma 4.2. Let the nonnegative definite form h and the subspace ran(X) C Q be given. 
Let H be the nonnegative definite operator defined by the form h. The form h! from is 
closed and positive and it defines the self adjoint operator H 7 . Furthermore. H 7 is positive 
definite if H is positive definite, cr es <j(H) = <T ess (H 7 ) and 

(4.5) H'X = XE, 
for E = (H ^ 2 X)*H^ 2 X G C nxn . 

Proof. The operators H 1 / 2 P and H 1//2 (I — P) are closed and so is the form 

ti{u, v) = h(Pu, Pv ) + h(( I - P)u, (I - P)v). 

It is obviously nonnegative, so it defines a nonnegative selfadjoint operator H 7 . We will 
now show that the subspace ran(X) reduces H 7 . Indeed, for y G Q, x G C n we have 

ti(y, Xx) = (H 1/2 y. U 1/2 Xx) - (H 1/2 (I - P)y, H 1/2 Xx) 

= (H 1/2 XX*y, H 1/2 Xx ) 

= (ZX*y,x). 

This is equivalent to 

(H V 2 y, K^Xx) = (y, XEx), y G Q, x G C n . 

It implies ran (A") C V(H') and 

(y. H'Xx - XEx) = 0, 

for all y G H, x G C n . Hence, 

(4.6) H'X = XS 

which is equivalent to the statement that P commutes with H 7 (see Definition EH). We 
now prove that cr ess (H) = cr ess (H 7 ). Assume h is a positive definite form, then h' from 
(EH is positive definite, too. From EH we obtain 

H'- 1 ^) = (H'-X - H"X, v), u, v G H. 

On the other hand 

5h( H“X, H'"X) = (H 1/2 PH“X, H 1/2 Pj.H'“X) + (H 1 /2 Pj.H“X, H 1 /2 PH'“X) 

defines a compact operator. Theorem 12.41 implies cr ess (H) = <T ess (H 7 ) and the statement of 
the theorem is proved for a positive definite h. In the general case, take a > 0. The form 
h(u,v) = h(u,v) + a(u,v ) is positive definite. Furthermore, we establish 

h\u, v) = a(u, v) + h'(u, v ) 

5h(u, v ) = Sh(u, v), 

so ay,;.,(H) = caXH 7 ). The conclusion cr eS s(H) = cr e ss(H 7 ) follows by the spectral mapping 
theorem. □ 

Corollary 4.3. Let the nonnegative definite form h and the subspace ran(X) C Q be 
given. The projections P = XX* and P ra n(H') commute and ker(H 7 ) C ker(H). 
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Remark 4.4. For positive definite h Lemma 14.21 describes the operator H' in sufficient 
detail. For a general nonnegative h the operator H' has somewhat more complex structure. 
Further properties of the operator H', constructed in the case in which h is a general 
nonnegative form, will be discussed in Section 14. II 


We now concentrate on the positive definite case. 

Theorem 4.5. Let the subspace ran(A") C Q be given and let h be positive definite. Assume 
sin© := sin©(H 1 / 2 X,H- 1 / 2 X) < 1, then 

(4.7) (1 — sin©)^'^] < h[u\ < (1 + sin©)/i'['u], u G Q(h) 

(4.8) (1 - -^—fiL-)h[u] < h'[u] < (1 + T ^-r- F r)h[u], u G Q{h). 


1 — sin © 1 — sin ©' 

Proof. The product H 1//2 H'“ 1//2 is well defined since Q = ©(H 1 / 2 ) = X^H' 1 / 2 ). This 
implies that the form 

5h s (x,y) =5h(H' 1 / 2 x,H' 1/2 y) 

defines the bounded operator 5H S . After the substitutions u = H' _1//2 x, v = H /_1 / 2 i/ we 
obtain 

1 6h(u, u)| 


(4.9) 


max 


= \\SH s 


u,veQ(h) ^Jh'[u\h'[v\ 

We now show ||5R S || = sin©. Set 

(4.10) V = H 1/2 PH'- 1/2 

(4.11) W = H 1/2 P ± H 7 - 1/2 , 
with Pj_ = I — P. Relation (14.41) implies 

dhfH'-^u, K'- 1/2 v) = h{P ± U'~ 1/2 u, PU'-^v) + h(PK'~ 1/2 u, P ± B.'~ 1/2 v ) 

(4.12) = (Wu, Vv) + (Vu, Wv), 
which can be written as 


(4.13) 5H S = V*W + W*V. 

The equations (I4.10I) - (I4.13I) yield 

(4.14) VW* = WV* = 0 

(4.15) ||APJ = \\W*VV*W + V*WW*V\\ = \\V*W\\. 

As the next step we establish that V and W are partial isometries such that 

(4.16) ran(R) = ran(H 1 / 2 P) 

(4.17) ran(W) ± = ran(H" 1 / 2 P). 

The proof will follow from Lemma 14.21 It runs along the same lines in both cases, so 
we will only present the proof for W. Take some u,v G 74, then 

(Wu, Wv) = (H 1 / 2 P ± H'- 1/2 -u, H 1/2 P ± H / - 1 / 2 n) 

= X(Pj_H /_1 / 2 rt, Pj_H /_1 / 2 n) = //(P ± H'' 1/2 u, P ± H'- 1/2 n) = (P±u,v), 
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so W*W = P±. This proves that W is a partial isometry. 

Relation (14.1(111 is obvious, since 

ran(H 1/2 PH'- 1/2 ) = ran(H 1/2 P) 

is guaranteed by the assumption ran(P) C Q{h ) and the injectivity of H ,_1 / 2 . 
The proof of (14.171) requires a bit more work. One computes 

W*a~ 1/2 P = H ,-1/2 Pj_H 1/2 HP 1/2 P = 0, 

which implies 

ran(H“ 1/2 P) C k er(W*) = ran(1T) ± . 

On the other hand 


(4.18) 


W* = P±A, 

7i is a homeomorphism (of linear topological vector spaces), 


where A = H'- 1 / 2 !! 1 / 2 : H 
so 

dim ker {W*) = dim ker(Pi) = dim ran(P) = dim ran(H~ 1//2 P) 
and (1X171) is established. The assumption sin© < 1 and Lemma 1X71 guarantee 

sin© = ||y*W||. 

Finally, using m we establish 

(1 — sin Q)h'[u\ < h[u] < (1 + sin Q)h'[v], 
which is the statement (HZZD- 

It is a well known fact that given some 0 < A, /i and 0 < ij < 1 the implication 


(4.19) 


|A-/i| 


< r] 




< 


V 


fi A 1 — T] 

holds. Since h and h' are positive definite forms, the relation (0IHD is proved. 
Example 4.6. Let — d xx be considered as the selfadjoint operator with 

v{-d xx ) = {ue H 2 [ 0,1] : w(0) = u(l) = 0}. 


□ 


The partial integration establishes that — d xx is defined by the positive definite form 

(4.20) h(u,v) = f d x u d x v dx, u,v G Q(-d xx ) = Hq[0 } 1], 

Jo 

The operator d x u, u E Hq[ 0,1] is closed, but not selfadjoint, therefore (14.201) is an alter¬ 
native operator representation (factorization), to the “square root” representation (14.2111 
of the form h (the operator — d xx ). 

Take any positive definite form h , then 

(4.21) h(u,v) = (H 1/2 u,H 1/2 n) 
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is only one of the possible operator representations of the form h. All of the preceding 
results are independent of the choice of the operator representation h(u,v ) = (R/u, R/u), 
since 


(4.22) 


sin 0 


max 

u,v£Q 


1 5h(u, n)| 
a/ h! [u] h! [n] 


and hi depends only on h and ran(P). 

Furthermore, all of the representations of the form h are in a sense equivalent. Let 
R : hi —> Li' be a closed operator such that 


(4.23) 


h(x, y ) = (Re, By) = (H 1/2 x, H 1/2 y) 


and Q = V(B) = P(H ly/2 ), then by E Ch. VI.7] 

(4.24) R = I/H 1/2 , R* = H 1/2 [/*, 


where U is the isometry from TC onto ran(R). Independence of the estimate (El) from 
the representation El could have also been proved by the unitary invariance of the 
canonical angle and ED) . Formula (14.2211 is an important corollary of Theorem 14.51 In 
the next theorem we prove that also, 


(4.25) 


sin© \8h(u,v)\ 

- - : —— = max — 

1 — sin 0 u,vgq y/h[u]h[v] 


holds. Equations ■ n~ 22 l > and (14.2511 demonstrate that the constants sin© and 1 sl ^ e in (14.71) 
and ED cannot be improved upon. 

The following lemma is taken out of the joint paper ||, cf. {5j . We present it here 
without a proof. 


Lemma 4.7. Let the form h be positive definite and let the forms hi and 5h be as in 
then 


(4.26) 


max 

u,v£Q 


1 5h(u, u)[ 
y/h[u\h[v] 


sin 0 


1 — sin 0 


holds. Here sin© = sin0(H 1 / 2 X H^X), where ran(X) C Q was the subspace used to 
define h' and 5h. 


4.1. The nonnegative definite case. In the nonnegative case we have to provide an 
alternative definition for a subspace that will play the role of ran(H _1 / 2 A"). We have 
shown W = H 1 / 2 Pj_H ,_1 / 2 to be a partial isometry such that 

W = ran(H 1/2 P x ) ± = ran(W) ± = ran(H~ 1/2 X). 

The left part of the equality is also well defined in the case in which H 1 / 2 is not invertible, 
so we set 

W = ran(H 1/2 P ± ) ± . 
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The construction (Q was performed with the assumption that h is nonnegative definite 
and ran(X) C Q. Lemma 14.21 says cr ess (H) = cr ess (H') so H^/ 2 is a bounded operator and 

(4.27) V = H 1/2 PH' tl/2 , 

(4.28) W = H 1/2 P ± H' tl/2 , 

are everywhere defined. Corollary 14.81 enables us to conclude that ran(C) = ran(H 1 ^ 2 P) 
and ran (W) = ran(H 1 / 2 Pj_) , so we set 

(4.29) V = ran(V), W = ran(lT) ± . 

Lemma 14.21 states that given a positive definite H the constructed operator H' must 
always be positive definite. In general nonnegative situation we have only the result of 
Corollary 14.81 It establishes that H' is a nonnegative definite operator and that ker(H') C 
ker(H). This does not give sufficient information on the structure of FF. Formulae like 
m-m are meaningful in the nonnegative definite case, too. They, however, invariably 
imply ker(H) = ker(H'). We, therefore, proceed is two steps. Firstly, we establish a general 
(theoretical) condition on the subspace X = ran(P) which guarantees that ker(H) = 
ker(H'). As the second step we give a practical computational formula. 

The subspaces W and V need not have the same dimension, so we will have to use the 
principal angle to compare them, cf. Lemma fd.il In what follows we show that 

sin © P (V, W) 

takes the role of sinO(H 1,/2 X, H _1 / 2 X) in the nonnegative version of Theorem 14.51 In the 
case when H 1//2 is invertible (14.171) implies V = ran(H 1//2 A") and VV = ran(H 1 ! 2 X). The 
subspaces H l/ 2 A and H 1/ 2 A have the same dimension, so Corollary 13. 1 1 yields 

sin © P (V, W) = sin0(H 1/2 A,H- 1/2 A). 

We establish the properties of V and W and give a characterization of the subspace W 
in the following lemma. 

Lemma 4.8. Let A = ran(P), V = H 1 / 2 PH' t1 / 2 and W = H^P^H'* 1 / 2 then 

(4.30) V*V = Pran(H'P) 

(4.31) W*W = P ran(H ' P± ) 

(4.32) WV* = VW* = 0, 

(4.33) W = inv(H 1/2 )A, 

where W is from E ^ ) and 

inv(H 1/2 )A = {x : H 1/2 x G X} 

denotes the inverse image of the subspace X under the mapping H 1 / 2 . 

Proof. The relations (14. 3 01) - (14. 3 21) follow analogously as in the proof of Theorem 14.51 It 
only remains to prove (lOH . 

We first show that inv(H 1 ' /2 )A C W = ran {W) L . Take any u G inv(H 1//2 )A, then 

H 1/2 « = .~gL 
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This implies 

0 = (z, P ± H' n / 2 v) = (u, H^ 2 P ± H' fl / 2 v), veH 

which proves u G ran (W) 1 - = W. 

The other inclusion follows in two steps. Take u G W, then 

(«, H 1 / 2 P ± H tl/ 2 u) =0, v G H. 

On the other hand, the subspace 

ran(P J .H't 1 /2)^ = ran(P ± P ran(H p) ± C V^ 1 ' 2 ) 

is finite dimensional, so we conclude u G X^H 1 / 2 ). Corollary 14.01 implies 

0 = (H l ' 2 u, P ± P ran( h')u) = (H 1/2 u, P ran{ w)P±v) = (H P ± v), veH, 

which proves H 1//2 n G X. With this conclusion we have established (14.TIP . 


□ 


As a direct consequence of Corollary Id. II and (14.441) we obtain the following result. 
Corollary 4.9. Let X = ran(P), V = H 1 / 2 PH't 1 / 2 and W = H 1 / 2 P ± H't 1 / 2 then 

\\PvPyv4 < H-fv-L-fWlI; 


so 

(4.34) sin0 p (H 1/2 A,inv(H 1/2 )A) = ||C*W||. 

It would be pleasing to use H 1 / 2 ! in the place of inv(H 1//2 ). This is only possible under 
additional restrictions on the subspace ran(P). To get better feeling for the meaning of 
sin0 p (H i/,2 T, inv(H 1 ' /2 )A) consider the following example. 


Example 4.10. Take 


then 


H = 


1 1 
1 1 


A = 


1 

0 


H' = 


1 0 
0 1 


r 1 

1 -| 
f 

, P 1/2t = 

- 1 

2^2 

1 - 

2^2 

\-V2 



-2 pi 

2pl- 


is, unlike H, a positive definite matrix. Now, 

H 1/2 = 

and we compute 

ran(E) = span{[l 1 ]*}, ran(W) ± = span {[—1 1 ]*} 
which proves that in this case sin0 p (ran(C), ran(W) ± ) = 1 and 

ran(W) ± = ker(P) ^ ran(H 1/2 ^P). 
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Instead of advocating the use of the general formula we will establish a “com¬ 

patibility condition” under which we may use the generalized inverse of H 1 / 2 to check the 
statement of the theorems. 

The next result is a nonnegative analogue of Theorem 14.51 It will enable us to, in effect, 
“deflate away” the kernel of the nonnegative form h and reduce the problem to the positive 
definite case. 

Theorem 4.11. Let the subspace X = ran(P) C Q be given and let h be a nonnegative 
form. Assume sin@ p (H 1/,2 T, inv(H 1 / 2 )T) = sin@ p < 1 then 

(4.35) (1 — sin Q p )h'[u] < h[u\ < (1 + sin0 p )h'[u], u G Q(/r), 

(«6) (i _ + sm^ „ e Q(h). 

1 — sm 0 p 1 — sm 0 P 

Proof. The proof is similar to the proof of Theorem 14.51 Let h! and Sh be as in (14.41) . Set 
5H S to be the operator defined by the form 

8h a {x, y ) = 5h( H' fl / 2 x, H ' n,2 y), x,yeH. 

The form Sh s is closed and everywhere defined, so 5H S is a bounded operator. We obviously 
have ker(H P/ 2 ) = ker(H 7 ) C ker (5H S ), so P ran(H ') commutes with the operator 5H S . With 
the use of Corollary 14.31 one computes, analogously as in Theorem 14.51 

5h{ H' tl/ 2 x, H ' n/2 y) = h(P ± U' n / 2 x , PH.' n/2 y) + h(PB.' n/2 x, P ± H' n/2 y ) 

= ( Wx,Vy) + (Vx, Wy ), 


so 


Since H' 1 / 2 H't 1 /2 


(4.37) 


5H S = V*W + W*V. 

-Pran(H') we obtain 

1 5h(u, u)| 


max 


M,i;eran(H')nS ^h'[u]h'[v\ 


= || 8H S 


\\vw\\. 


Corollary 14.91 implies that the assumption sin0 p < 1, in fact, reads 

sin@ p = \\V*W\\ < 1. 

With this in hand, we have established 

(1 — sin0 p )/i , [u] < h[u] < (1 + sinOp)/! 7 ^], u G Q{h), 

which implies ker(H ; ) = ker(H). The relation (14.3(11) follows by the same argument as the 
one used in Theorem l-TKl □ 


The main insight into the structure of the operator H 7 , gained from Theorem 14.1 11 is 
summed up in the following corollary. 

Corollary 4.12. Take a nonnegative form h and a subspace X = ran(P) C Q. If 
sin0 p (H 1 / 2 T,inv(H 1 / 2 )T) < 1 then ran(H 7 ) = ran(H). 
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Corollary 14.121 gives precise meaning to the statement “deflate away”. Set TZ = ran(H) = 
ran(H') and J\f = ker(H) = ker(H / ). The projections Py and P commute, so 

P^nran(P) Pj\fPi P P Pj\TC\ran(P) 
are both orthogonal projections. A direct calculation shows 

A? : = ran(P) = ran(P) © (A/'n ran(P)) = ran(H') D ran(P) = ran(H'P). 

The form 

h(u, v ) = h(P n u , P n v) 

is positive definite in TZ and ran(P) C Q(h ) fl TZ. Now, apply the construction (14.211 - (14.41) 

to the form h and the projection P. By H : TZ — > TZ denote the operator defined by the 
form h in TZ. then ran(P) C TZ and 

h'(u,v) = h'(Pjzu, Prv). 

We conclude that 

sin 0(H 1 / 2 A,H^ 1 / 2 A) = sin @ p (H 1/2 A , inv (H 1 / 2 )A) < 1 

and h and P satisfy the assumptions of Theorem 14.51 If we were to “a priori” assume 
ran (H 7 ) = ran (H), then this argument would give an alternative proof of Theorem 14.111 
“Deflate away” means that we assume we were given h and P as input. 

Remark 4.13. Another consequence of Corollary 14.121 is that we can invoke Lemma 14.71 to 
conclude that the constant (in (14.3(11) 1 cannot be sharpened. Furthermore, Example 

14.101 shows that the assumption 

sin @ p (H 1/2 A , inv (H 1 / 2 )A) < 1 

is a necessary requirement to establish the inequalities (TQ5D and JOED as well as to 
guarantee that ran(H) = ran(H') (equivalently ker(H) = ker(H 7 )). 


4.1.1. Important special case. The assumption that P and Pk ei -(H) commute and Corollary 
14.31 yield ker(H) = ker(H') and ran(H) = ran(H'). This implies 

(4.38) inv(H 1/2 )A = H 1/2t A. 

The projections P and Pk er (H) certainly commute when ker(H) _L ran(P) or when 3 ker(H) C 
ran(P). This discussion is summed up in the following corollary. 


Corollary 4.14. Assume P = XX* and Pk er (H) commute and let 

sin0 p (H 1/2 X,H i/2t X) < 1, 

then 


(4.39) 

(4.40) 


( 1 - 


1 — sin 0 p 


3 The other situation when P and Pker(H) commute is when ran(P) C ker(H), this situation is however 
trivial and we have tacitly left it out. 
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Remark 4.15. To assess the restriction that P and -Pker(H) should commute, consider the 
definition of the relatively accurate approximation of the number A G R+. /i G M+ is 
relatively accurate approximation of A G M + , if 

(1) A = /i, when A = 0 

( 2 ) < i ; w hen A ^ 0. 

This implies that we can expect to compute “relatively accurate” Ritz value approximation 
of the spectrum of the nonnegative definite operator H only in the case when we have 
computed a basis for ker(H), cf. [T]. 

Remark 14.151 implies that we may assume that the condition of Corollary 14.141 were 
ker(H) T ran(P). To compute the basis of the set inv(H 1/2 )T’ we need to repeatedly solve 
the equation 

H 1/2 w = Xi, i = 1, dim(T’). 

The vectors Xi are assumed to be a basis for X. The restriction that ker(H) T ran(P) 
amounts to nothing more then to impose a compatibility condition on Xj (e.g. think of 
the Laplacian with Neumann boundary conditions). 


4.2. A first approximation estimate. Theorem 12 . 41 and Lemma ld. 21 yield the first eigen¬ 
value estimates. The next theorem will give an eigenvalue estimate with the minimum 
of the restrictions on the subspace ran (A") C Q. Sharper bounds are possible when we 
impose additional assumptions on ran (A). Even this (first order) estimate will compare 
favorably with other higher order bounds that can be found in the literature, cf. f|j. 


Theorem 4.16. Let 0 < h and let the n-dimensional subspace ran(P) C Q, P = XX*, 
be given. Define 

E = (H 1/2 A)*H 1/2 A, S G C nxn 

and assume p n < A e (H). Here, the Ritz values are numbered as in {HP- If ran(P) is such 
that sin@ p < 1, then there are n eigenvalues of the operator H, counting the eigenvalues 
according to their multiplicities, such that 


(4.41) 

(4.42) 


I Aq - Hi | < Ip sin 0 P , 
sin 0 

IA ij ~ Hj\ — ^ 


1 — sin @ p ’ 


j = l,...,n 


where it.) : N —> N is a permutation. 


Proof. Corollary 14.121 readily implies the conclusion (14.4111 for the Ritz values [ij = 0? 
j — 1,..., dim(ker(S)). Therefore, we may safely assume that h is a positive definite form. 
Lemma 14.21 implies cr ess (H) = cr ess (H / ), so the assumption /i n < A e (H) guarantees that ptn 
is a discrete eigenvalue of H'. Theorem 14. 1 II established 


(1 — sinQp)//^] < h[u\ < (1 + sinQp)^ 7 ^], u G Q(h) 


( 1 - 


sin 0 r 


1 — sin 0 


-)h[u] < h'[u\ < (1 + 


sin 0 r 


1 — sin 0 


-)h[u\, u G Q{h). 


The conclusion follows directly from Theorem 12.41 


□ 
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For the numerical evidence concerning the performance of the estimate O' see the 
numerical tests from j]|. 

5. Localizing the approximated eigenvalues 

There is a multitude of ways to match the computed Ritz values to a part of the spectrum 
of the operator H of the same multiplicity. These approaches usually differ with regard 
to the allowed amount of additional information about the spectrum of the operator H. 
Here, we present two possible answers to that problem. 

Theorem 14.1(11 can be interpreted as a first localization result. It gives an estimate of 
the infim n m of 

\^ij ~ fJ-j | 


max 




over all of the permutations h.) : N —► N. So, we would be correct in stating that the Ritz 
values are approximating the eigenvalues of H that are closest to cr(H). 

Having only limited additional inbrmation we got a limited answer. We know that there 
is a collection of eigenvalues of operator H, having the joint multiplicity n, that is being 
approximated by the Ritz values from the subspace ran (A). The information we have on 
the location of those eigenvalues in the spectrum of H is only that they are the eigenvalues 
closest to computed Ritz values. 

Only when we have additional information about the location of the part of the spectrum 
we do not want to approximate , we can guarantee that we are approximating the part of the 
spectrum we are interested in. A best known example of such estimates is Temple-Kato 
inequality. Assume Ai < A 2 and let u G T>(H) be a unit vector such that (u. Hu) < 7 < A 2 
then 

(Hu, Hu) - (u, Hu ) 2 


(5.1) 


(u, Hu) > Ai > (u, Hu) — 


7 — (u, Hu) 

For a proof see [TE] , The estimate (15.11) is valid for a general selfadjoint operator H. 
In the following we shall formulate another assumption with the same effect, namely to 
separate the “unwanted” component of the spectrum from the Ritz values. Our result, 
however, does not need the regularity constraint u G T>{ H). Moreover, we will obtain 
sharp bounds for the matching cluster of eigenvalues. In the last section of this chapter we 
will demonstrate that on some examples our bound considerably outperforms the estimate 

(EU). 

We now give a theorem that determines those eigenvalues of the operator H, given by 
a symmetric form h, which are approximated by the Ritz values associated with the test 
subspace ran (A") C Q. Before we proceed with the formulation of the theorem we state a 
well known fact that given 0 < A,/r and sin0 p < 1 the relation 

|A — hi 


implies the relation 
(5.2) 


IA — Ai | 




< 


< sin 0„ < 1 


sin 0 r 


1 — sin 0, 


< 2 sin 0 r 
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Theorem 5.1. Set'fr = min {(Ap — Hk){\ + /x^) - 1 1A; = 1, ...,n; p = n + l,...,oo} and 
r]e p = sin@ p (l — sin@p) _1 . Take a nonnegative form h and the subspace ran(A") C Q. 
Assume r = dim(ker(H)) < n, set P = XX* and let h! be as in By pii <■■■< pL n , 

denote the eigenvalues of the matrix S = (H 1 / 2 A)*H 1 / 2 A e C nxn . If y r > 0 and t]q p < 
min{ 7 r , 1} then 

(5.3) |Aj - Hi\ < ^ sin Q p , i = l,...,n. 


Proof. The assumption r/e p < min{ 7 r , 1} and Theorem 14.111 imply ker(H) C ran(X). 
Also, by Theorem 14 .11 1 we have ker(H) = ker(H'), so we are allowed to “deflate away” the 
kernel of H. Therefore, set P\ = P ra n(H'P) and proceed as if h were positive definite and 

P = Pi- 

The rest of the proof is completely analogous to the proof of 0 Theorem 5.1]. The only 
difference is that in the place of 77 = sin@ p /(l — sin@ p ) from |9J Theorem 5.1] one uses a 
sharper quantity sin0 p . 

If we are provided with the information that 


(5.4) 


m. 


sin @ p 
1 — sin @p 


< 7 c := min 


hk Ap 

mm ---, mm 

k=l,...,n Ap + /ifc k=l,...,n 
p=l,...,q—l p=q-\-n,..., oo 


^ p ^ 

Ap + ’ 


then Hi < ■ ■ ■ < ii n approximate the “inner” eigenvalues 


A q A Ag_|_2 A Ag_|_ n _i . 

This statement is made precise in the following theorem. 


Theorem 5.2. Take a nonnegative definite form h and a subspace ran(A") C Q. By 
Hi < • • • < Hn denote the eigenvalues of the matrix S = (H 1 / 2 A)*H 1 / 2 A G C nxn . If 
t]q p < 7 0 where 7 C is as in then ran(P) C ran(H') and 

| Aj_)_g_i Hi I — hi sill Up: ^ 1,..., n . 

Proof. The assumption (15.411 and Theorem 14.1 II and Corollary 14. 121 imply 

ran(H') = ran(H) and ran(P) C ran(H). 

The rest of the proof follows analogously as in the proof of Theorem 15.11 □ 


Remark 5.3. Theorems 15 . II and 15 . 21 imply that the spectrum of the operator H can stably 
(sensibly) be divided in two disjoint parts: the part that is being approximated by the cr(S) 
and the rest of the spectrum. To understand this statement assume that the conditions 
of Theorem rm hold. In this case both of the “block diagonal” forms 


h(u,v) = h(E(\ n )u, E(X n )v) + h(E(X n ) ± u, E(X n ) x v) ~ 
h'(u, v ) = h(Pu , Pv) + h(P±u, P±v) — 


A 


A r 


have “diagonal blocks” with disjoint spectra. We have assumed A = diag(Ai,..., A„) and 
5 = diag(//i,..., Hn) and H c and A c were unbounded operators defined by the forms h' 
and h in the spaces ran(PjJ and ran(P(A n )j_). In fact, we will colloquially call h! the 
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block diagonal part of the operator H with respect to the subspace ran(P). We will use the 
notation hp to denote h' in situations when it is not clear with respect to which test space 
ran(P) was this construction performed. 

6. Eigenvector approximation estimates 
For the computed Ritz values 

0 , 0 , . . . , 0 , fi r - 1 _ 2 , • • • , P'n 

Theorem 14. 161 guarantees the existence of the eigenvalues 

^*1 — ^*2 — ' ' ' — ^inl 

that are being approximated by the Ritz values (provided sinB p < 1) in the sense of 

Wj - lh < hj sin © P , 3 = !,■■■, n. 

Assume v\,...,v n are mutually orthogonal eigenvectors that belong to the eigenvalues 
Ajj < ' A ^2 ^ ^ A in • If the conditions of Theorems IQ and 15.21 are satisfied Remark 15.31 

assures us that 

spanjvi, ...,v n } = ran(P({Aj 1 , Aj 2 , • • • ,A in })). 

Here we have assumed that H = f A dP(A). To ease the presentation we generically use 

E = ^({Ajj, Aj 2 , • ■ • , A in }) 

to denote the projection on the subspace that is selected by a result like Theorem 15.11 
The central role in the analysis of the eigenvector approximations will be played by the 
following lemma. 

Lemma 6.1. Let h be a nonnegative form and let H be bounded. Take ran(P) C Q such 
that sin@ p < 1 and define 

s(x, y) = 5h(W 1 / 2 x, H ' r ^ 2 y), x,y eH. 

The form s defines a bounded operator S and 

(6.1) S = H 1/2 H' tl/2 - Hi 1 / 2 H /1 / 2 

(6.2) \(x,Sy)\ = K:r,y)| < , Up _ ||x||||y||, x,y G H. 

yf 1 — sm 

Proof. The closed graph theorem implies that the operator 

S = H 1/2 H' tl/2 - Hi 1 / 2 H /1 / 2 

is bounded. Also, ker(H) = ker(H') = ker(S') and Pk er (s) commutes with S. It is sufficient 
to prove the estimate for x,y G ran(H). The inequality (14.371) gives 

\5h(H t ^ 2 x,H' Jfl ^ 2 y)\ <sin0 p ||y|| h'[H. r ^ 2 x] 1 ^ 2 . 


H ,1 / 2 Ht 1 / 2 II <_ ^ 

— y/1 — sin Bp 


Analogously, (14.351) implies 
(6.3) 
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Altogether, the estimate (El follows. 


□ 


(6,4) 


The operator S has the special structure. Assume H 'u = /iu and Hu = Xv, then 

(u, Su) = A 1 // 2 (u, u)/! 1 ^ 2 — A _ 1 ^ 2 (u, u )/x 1//2 

A — n / 


\/\jl 


(V,UJ 


The equation El introduces the distance function 

A — ju 


vv 

that measures the distance between the Ritz values and the spectrum of the operator 
H. This distance function will feature in the important role in the estimates that follow. 
The next theorem extends the scope, as well as strengthens the eigenvector estimate from 
pin and is even new in the matrix case. It can be seen as the eigenvector companion 
result of Theorem 14.161 


Theorem 6.2. Let h be a nonnegative form, and let ran(P) C Q be such that it satisfies the 
assumptions of Theorem 4-lb\ Let u\,... ,u n the mutually orthogonal eigenvectors belong¬ 
ing the eigenvalues fi\,..., p, n ofU'P, then there exist mutually orthogonal eigenvectors 
Vi,... ,v n of H, belonging to the eigenvalues A,,,... \ in and 


(6,5) 

The eigenvalues A,., j 

EZ3 ' 


y/2 sin 0 P 

m — uA\ < —= max 
J J y/1 - sin Q p 

1 ,,r are numbered in the ascending order as given by Theorem 



Proof. Assume pi = • • • = p r 
so we take 


0. Corollary 14. 121 implies that W; G ker(H) for i 
Vi — Ui, % — 1 , • ■ • , r. 


1 ,- 


r 


For Vj, j — r+1,... n take any orthonormal set of eigenvectors belonging to the eigenvalues 
A j., j = r + 1,... ,n. Since both Ui and Uj, for i — r + 1,..., n are perpendicular to ker(H) 
we may assume that H is positive definite and we are only given u^, i — r + 1,... n as test 
vectors. Take s from Lemma 16.11 and use EH) to compute 


s(Ufc, Uj ) 


and 


Sh(H~ 1/2 v k , n'~ 1 / 2 Uj) = (u fc ,H 1/2 H '- 1 / 2 u i ) - (H' 1/2 H- 1/2 u fc , u^j 

(^V7 1/2 -^ 1 V 2 )K,u J ) 


J] I (Vk,Uj) 
k^j 


< max 


nax 7 -----r > I s(vk, Ui)\ < max —---— 


< max 


fc/i (A 


A i k pLj sin 2 0 P 

ik - [if) 2 1 - sin @ p 
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Scaling Vj,Uj so that ( Vj,Uj ) > 0, we obtain 


l>j Uj 


< 


< 


V2 

V2 


i ■> u j ) 


1 1/2 


Vz 1 - [1 - I (vk,Uj) I 2 ] 

k~7tj 


21 1 / 2 ' 


1 1/2 


[1 


max 




sin 2 0 


—P-l 

{\ k ~ HjY 1 - Sill 0 p J 


1/2 


1/2 


x /2 sin 0 P WWjA 

— = max —-- 

V 1 — sin 0 P 


This proves the lemma in the case in which cr e (H) = 0. In the general case we use the 
formula 


\fKPj 
A e dj 


( [] y/A^, OOA ^Ejji/ 2 (A e , OO^ 'jUj 5 Uj 


and analogous argument. 


□ 


7. A SIMPLE MODEL PROBLEM 


We will now present an application of our theory to the singularly perturbed Sturm- 
Liouville eigenvalue problem m- Estimates EH and t 2J Theorem 6.21(Kato-Temple 
eigenvector estimate)] do not apply due to overly stringent regularity assumptions on the 
test vector, cf. flU Mnnp - 

Consider the family of positive definite forms 

(7.1) h v [u,v) — hb(u,v) + r] 2 h e (u,v) — J u'v'dx + if J u'v' dx, u,v £ Hq[ 0 , 2 ], 

By H, denote the positive definite operator which is defined by the form h v from (EH). 
We are interested in eigenvalues and eigenvectors of the operator H, ; for large 77 . Here, 
Hq [0, 2] denotes the first order Sobolev space with zero trace on the boundary. 

This is the eigenvalue problem for the vibration of a highly inhomogeneous string. We 
are only considering an academic example where we can efficiently compute all information 
we need. For more realistic applications see [Hi. 

If we identify the functions from Hq[0,oi], a > 0, with their extension by zero to the 
whole of [0, /3\ for (3 > a, then we can write 

(7.2) HZ[0,a]<zH%[0,p], 0 < a < (3. 


Let X[ 0 .i] be the characteristic function of the interval [0,1] and let X[o,i] c 
Keeping EH in mind, we conclude that 

H, ; = -d x (l + rj 2 X[o,i]c)d x , X>(H, ; ) = H 2 [ 0, 2] D H%[ 0, 2], 

It is known that the forms h v converge to the form 

hoo(u,v) = / u'v' dx, u,v G Hq [0,1] 


1 - X[o,i] - 
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Figure 1. Various test functions for and H^, 77 large. 


in the norm resolvent sense 4 . Operators H r/ and H M have discrete spectra and all the 
eigenvalues are nondegenerate, cf. m- Since we will be considering the whole family of 
operators H ?? , additional notation will be introduced to ease the understanding. By 

v < • ■ ■ < v < • • ■ 

we denote the increasingly ordered eigenvalues of the operator H r/ and by 

Af < ■ ■ ■ < A~ < ■ ■ ■ 


the eigenvalues of the operator Hoc. 

The eigenpairs of the operator H lDO —which is defined in L 2 [ 0,1]— are 
(n 2 7 r 2 , \/2sin(ranr)), n G N. The functions 


(7.3) 


U n {x) 


\/ 2 sin(ranr), 0 < x < 1 
0 , 1 < x 


n G N 


are in H ( \ [0,1] and also in Hq[0, 2]. Therefore, they can be used as test functions for an 
approximation of the eigenvalues of ( for large if). Furthermore, according to (|1 .71) we 
obtain 

• 2 r\ ( \ ■ 2 /xj- 1/2 xjl/2 \ (Wj, H Uj) — [Ui, 

sm 0„(ui) := sm 0(H ' m, H ' m) = - 7 - 75 - 7—7 -■ 

Let us now concentrate on the approximation of the lowest eigenvalue. We compute the 
Ritz value 

h^iu^uf) = 7r 2 . 

When sin0„(u!) < 1, Theorem 14.161 guarantees the existence of an eigenvalue Xf such that 


IV, - Af 
Af 


< sin Q v (ui) . 


4 More on the properties of this convergence can be found in mm- 
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A direct computation shows that 
- H/m/ 

'y (i + (i + h 2 ) 


'0 ho 

i\ 


(7.4) 


2 + T] 2 

x (l + (l + v 2 ) (l -y)) 

2 + T) 2 

= 0(y~ 2 ). 


— y (1 — x) ) sin(7rt/) sin(7rx) dy 


dx 


x (1 — 2 /) ) sin(7T7/) sin(7rx) dy 


(2 + y 2 )^ 2 

This establishes that sin@, ? (ui) —> 0, so Theorem 14.161 will be applicable for y > 1 such 


that 


(wi,H V) - (m^H/m/ 


< 1. 


KH-V) 4 T Tj 2 

Furthermore, based on HU and HH] we conclude that the assumptions of Theorem [HI 
must be satisfied for y large. We will now investigate this claim further. 

The eigenvalues of the operator H r; satisfy the equation 


(7.5) 


I ^rj 

■v/T+7cot(^) + cot (J ——g 

V 1 + ?r 


r) =°- 


and the nonnormalized eigenvectors are 

'sin(\/Afx), 
v?(x) = ( sin(y / Af) 


sin / 


A? 


/ / A? \ ' 1/ 1 + T) 2 

811 MVw) V 

Set Vi = ||u^|| -1 Vi then Theorems ED and 1(121 imply 

irlAf 


X 


0 < x < 1 
)> 1 — 


X 


sin Z(vi,ui) = Hu) 2 —Mi|| < 


An 


TT* 


From Q we establish the uniform estimate 

„ „ „ 1.333334 

\\V{ — Mi|| < 


4 + y 2 — a/8 + 2y 2 


\JA -\- y 2 — a /8 + 2?/ 2 


y > 2. 


This illustrate a way to obtain rigorous eigenvector estimates. First, we have localized 
the approximated eigenvalue by an application of Theorem 15.11 This has selected the 
approximated eigenvector. Theorem Id.21 then yields an accuracy of that approximation. 
Let us note that 

hoc {^rLi ^n) h v (u n , U n ) Tl 7\ 

(M n , H, ? M n ) (m ni H/m„) 2 


sin Q v (u n ) = 


(M n , H ^M n ) 


4 + y 2 
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This implies that we can get estimates for all A) 7 and v 7 by an analogous procedure. In es¬ 
tablishing the convergence results for higher eigenvalues and eigenvectors it was important 
that we a priory new that all X v were nondegenerate. Our theory has successfully been 
applied to similar singularly perturbed operators which were defined in L 2 (fl), 0 C R n , 
see |Hj. For those operators such a claim does not hold. There it is important to gener¬ 
alize the subspace results from [9] as well as to obtain higher order estimates (in sin©,^) 
for eigenvalues. These results were obtained in the PhD. Thesis [8] and will be reported 
elsewhere. 


8. Conclusion 

A method to compute an estimate of the accuracy of the subspace approximation method 
is presented. It can also be used to obtain accurate lower estimates of a desired group 
of eigenvalues. The bounds have to be viewed as a combination of the Ritz value bound, 
which gives an existence of the matching of the Ritz values and eigenvalues, and the 
subspace bound, which describes the nature of that matching. 

The case study that was just performed can be described as leading to a “pseudo spec¬ 
tral” method. We have used the completely solvable (“well behaved”) operator 

(H^V H^ 2 u) = hoo(u, v) = f u'v' dx, u, v G H] [0,1], 

Jo 

to analyze the singularly perturbed operator H (/ . Since the eigenvalue problem for the 
operator was completely solvable, we have used the eigenfunctions of the operator 
Hoc to define a test space for the operator H,,. Analogously, we could have used other test 
functions from A/q [0,1] to analyze the operator H^. For instance, assume we have used 
the linear finite elements to compute an approximation Ui of the function tq, see Figure |U 
Theorem 14. 1 hi can be invoked if we find a way to estimate sin@(H r ( 1 ^ 2 Mj, h ; /2 Ui). The study 
of singularly perturbed eigenvalue problems and finite element spectral approximations has 
been performed in [8]. The results will be presented in subsequent reports. 
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